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Abstract. 

The Information Flow Framework (IFF0 is a descriptive category metatheory currently 
under development that provides an important practical application of category theory. 
It is an experiment in foundations, which follows a bottom-up approach to logical de- 
scription. The IFF forms the structural aspect of the IEEE P1600.1 Standard Upper 
Ontology (SUO) project. The categorical approach of the IFF provides a principled 
framework for the modular design of object-level ontologies. The IFF represents meta- 
logic, and as such operates at the structural level of ontologies. In the IFF, there is 
a precise boundary between the metalevel and the object level. The modular archi- 
tecture of the IFF consists of metalevels, namespaces and meta-ontologies. There are 
four metalevels, Lower, Upper, Top and Ur, corresponding to the set-theoretic distinc- 
tions between the "small" , the "large" , the "very large" and the "generic" , respectively. 
Each metalevel services the levels below by providing a metalanguage used to declare 
and axiomatize those levels. Corresponding to the four metalevels are the four nested 
metalanguages, meta-lower □ meta-upper □ meta-top □ meta-ur, where each metalan- 
guage axiomatization includes specialization of the one immediately above. Within each 
metalevel, the terminology is partitioned into namespaces, and various namespaces are 
collected together into meaningful composites called meta-ontologies. The IFF contains 
meta-ontologies representing category theory, information flow, formal concept analysis, 
institutions, multitudes, equational logic, first order logic, the common logic standard, 
etc. All of the various meta-ontologies in the IFF are anchored to the IFF metastack 
Set C Cls C Col C Ur. The IFF development is largely driven by the principles of 
conceptual warrant, categorical design and institutional logic. The main application of 
the IFF is institutional: the notion of institutions and their morphisms are being ax- 
iomatized in the upper metalevels of the IFF, and the lower metalevel of the IFF has 
axiomatized various institutions in which semantic integration has a natural expression. 

" Philosophy cannot become scientifically healthy without an immense technical vocabulary. 
We can hardly imagine our great-grandsons turning over the leaves of this dictionary with- 
out amusement over the paucity of words with which their grandsires attempted to handle 
metaphysics and logic. Long before that day, it will have become indispensably requisite, too, 
that each of these terms should be confined to a single meaning which, however broad, must 
be free from all vagueness. This will involve a revolution in terminology; for in its present 
condition a philosophical thought of any precision can seldom be expressed without lengthy 
explanations." — Charles Sanders Peirce, Collected Papers 8:169 
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1 Several years ago there was a discussion thread on the SUO email list, entitled "composing ontologies 
using morphisms and colimits" and involving Michael Uschold, John Sowa, and others. This discussion 
was concerned with the question of the use of category theory (versus set theory) in the IFF project 
as a foundation for knowledge representation and logic. Michael Healy forwarded this question to the 
category email list. This paper can be regarded as an extended answer to this question. 
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1. Introduction 

Recall reading the classic Categories for the Working Mathematician (CWM) [12] by 
Saunders Mac Lane. This standard reference offers an introduction to the basic concepts of 
category theory, such as the categories^ "category", "functor", "natural transformation", 
"adjunction", "monad" and "(co)limit", and the relationships (some functional) "object", 
"morphism" = "arrow" (terms are equal when they are synonyms), "source", "target", 
"equivalent to" , "isomorphic to" , etc. CWM discusses a collection of well-defined concepts 
such as these, each being represented by a word or phrase in natural language augmented 
by mathematical notation. In addition, CWM contains a collection of logical expressions 
used to represent its meaning. The logical expressions of CWM are built up from its 
categories and relationships using logical connectives and quantifiers, or are expressed 
using commutative diagrams. For example, consider a page chosen at random from CWM, 
such as page 78. Some of the categorical concepts encountered on this page include: "free 
category", "small graph", "functor", "forgetful functor", "U : Cat -> Grph" = "the 
forgetful functor from the category of small categories to the category of small graphs", 
"morphism" , "unique map" , "category" , "natural isomorphism" , "Set" = "the category of 
small sets", "function", "bijection", "natural", "isomorphism", "adjunction", "bifunctor", 
u X op xA -)■ Set" = "a Set- valued bifunctor", "pair of objects", "horn-set", etc. Some of 
the logical expressions encountered on this page include: "for all modules A, B, and C 
over a commutative ring K, there is an isomorphism Hom(v4(g)^.B, C) = Hom(A, B®k C)" 
and "for all v4-morphisms k : a — > a' and all X-morphisms h : x' — > x the diagrams 

y^cc fx ^j^oc a 

A(Fx,a) —~X(x,Ga) A(Fx,a) —~X(x,Ga) 



A(Fx,k) 



X(x,Gk) A(Fh,a) 



X(h,Ga) 



A(Fx, a') -^X(x, Go!) A{Fx' , a) — -X{x', Go) 

commute". Some questions immediately arise. Can CWM be formalized, either approx- 
imately or fully? Should it be formalized? What meaning is captured by formalization? 
What meaning is missed? Should we wait for a more mature foundations of category 
theory before formalizing CWM1 Would a formalization of CWM be itself a candidate 
foundations of category theory? This paper discusses an effort called the Information Flow 
Framework (IFF) jH] , which can be viewed as an approximate formalization of CWM, since 
a substantial portion of CWM has already been axiomatized in the IFF, but which was 
originally formulated to be a descriptive category metatheory for object-level ontologies. 



2. Ontologies 

Since the IFF is being offered as a metatheory for ontologies, it is appropriate to ask the 
question "What is an ontology?". The term "ontology" was introduced in 17th century 

2 The philosophical notion of category. After all, the term "category" in "category theory" was pur- 
loined from Aristotle and Kant. 
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Figure 1: An e-commerce ontology fragment 



philosophy to represent the study of being, but had origins in Aristotle's metaphysics. In 
the 20th century, the term "ontology" was borrowed by the artificial intelligence commu- 
nity to represent a collection of things that exist. The meaning has since evolved, and 
now represents conceptual modeling and knowledge engineering. The various kinds of on- 
tologies form a spectrum according to formality, with catalogs and glossaries at one pole, 
taxonomies in the middle, and formal axiomatic theories at the other pole. One popular 
definition is that "an ontology is a formal, explicit specification of a shared conceptual- 
ization. It is an abstract model of some phenomena in the world, explicitly represented 
as concepts, relationships and constraints, which is machine-readable and incorporates 
the consensual knowledge of some community." Since it is an abstract model of some 
phenomena in the world, it is a semantic conception. Because it uses concepts, relation- 
ships and constraints, it is logic-oriented. Because it is machine-readable, it is formal and 
explicit. And since it incorporates the consensual knowledge of some community, it is 
shared and relative. Figure [1] illustrates a fragment of an E-commerce schema (ontology) 
taken from the papeJl [2], where concepts are represented as nodes, relationships are rep- 
resented as edges, and constraints are represented as parallel pairs of edge paths, limits 
and coproducts. 

We distinguish between object-level and metalevel ontologies. An object-level ontol- 
ogy represents some aspect of the "real world". Three levels of object-level ontology 
have been mentioned. Lower object-level (domain) ontologies represent the conceptual 
structures of some community; e.g., physics, chemistry, biology, government, business, 
etc. One example of a domain ontology is a physics ontology, consisting of the organiza- 
tion of the community of physicists, a collection of physical subtheories, such as gravity, 
heat, motion, electricity, magnetism, . . . , and technical terminology. Another example of 



3 This paper by C.N.G Dampney and Michael Johnson suggests regarding category theory as a meta- 
ontology, which fits exactly the spirit of the Information Flow Framework (IFF) project. 
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a domain ontology is The Gene Ontology (GO), an actual bio-ontology for the genetics 
community, consisting of the organization of the community of geneticists, subtheories, 
such as molecular-function, biological-process, cellular-component, . . . , and technical ter- 
minology. Middle object-level ontologies help organized domain ontologies by axiomatiz- 
ing some abstract facet, such as the conceptual structures of organizations, the model 
of an enterprise, the nature of governments, a template for educational institutions, the 
mathematical structures of events and processes, .... Upper object-level ontologies are 
limited to concepts that are meta, generic, abstract and philosophical. Upper-level on- 
tologies contain the most general classifications of entities. Hence, they tend to be domain 
independent and possess high reusability. Examples of upper-level ontologies include the 
WordNet online lexical database, the Standard Upper Ontology (SUO), the Cyc upper 
ontology, the High Performance Knowledge Base project (HPKB), the Suggested Upper 
Merged Ontology (SUMO), .... Of course, there may be overlap between domain and 
middle-level ontologies, and between middle-level and upper-level ontologies. And often, 
upper-level ontologies contain meta-information. 

A meta-level ontology is an ontology about ontologies — it represents some aspect of 
the organization of object-level ontologies. In this paper, we identify meta-ontologies and 
metatheories. However, object-level ontologies and theories are not identical, since object- 
level ontologies can be either populated or not. We identify unpopulated object-level 
ontologies with theories [aka schemas], and we identify populated ontologies with (local) 
logics [aka databases]. Unpopulated ontologies have only type information. Populated 
ontologies have both type and instance information, plus the classification relationships 
between these two kinds of things. This paper is about a metatheory called the Infor- 
mation Flow Framework (IFF), which is descriptive and category-theory-based. The fact 
that the IFF is a descriptive metatheory involves use of the constraint called "conceptual 
warrant". The fact that the IFF is a category-theory-based metatheory involves use of 
the motivator called "the categorical design principle" . 



3. Descriptive Metatheories 

Distinctions are important in ontology development. Two distinctions in particular are 
important for the IFF: the monolithic versus modular distinction and the prescriptive 
versus descriptive distinction^. However, these distinctions are sometimes confused - 
although prescriptive approaches are often monolithic and descriptive approaches are 
often modular, these two distinctions are conceptually different. A monolithic ontology is 
one-size-fits-all. The monolithic approach is not compatible with the need for continual 
revision and consistency checking. The modular approach, which advocates the lattice 
and context of theories, is very compatible with these needs. The monolithic-modular 
distinction is important for the maintenance of object-level ontologies (see the discussion 
below). However, in this section we are mainly interested in the prescriptive-descriptive 
distinction, which is important for meta-level ontologies; that is, metatheories. 



4 Compare the prescriptive-semantic distinction discussed in the paper [5] by Dampney and Johnson. 
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All natural languages are inherently dynamic living entities. However, the dynamic 
growth of some languages such as English is greatly aided by the way that it is cataloged. 
In 1755 the great English lexicographer and literary critic Samuel Johnson published 
his Dictionary of the English Language |5J. This work set the standard for all modern 
English dictionaries. However, this standard of lexicography is distinctly different from 
that used in the related French and Italian languages. The linguistic purity of French 
and Italian is to some extent regulated by the Academie Frangaise^ and the Accademia 
della Crusca[§, respectively. These two bodies were established "to prescribe the use of the 
language " US]. As the full titleH of Johnson's work implies, modern English dictionaries 
do not prescribe how the English language should be used, but instead describe how the 
language actually is used. The following quotation from the preface to [5] expands on this 
descriptive attitude. 

11 Every increase of knowledge, whether real or fancied, will produce new words, or combina- 
tion of words. When the mind is unchained from necessity, it will range after convenience; 
when it is left at large in the fields of speculation, it will shift opinions; as any custom is 
disused, the words that expressed it must perish with it; as any opinion grows popular, it 
will innovate speech in the same proportion as it alters practice." 

An ontology is similar to a dictionary, but has greater detail and structure. Both dic- 
tionaries and ontologies come in two basic philosophies: prescriptive or descriptive. A 
descriptive dictionary or ontology describes actual usage. Most modern dictionaries are 
descriptive with the Oxford English Dictionary (OED) as a leading example. As discussed 
below, the Information Flow Framework (IFF) follows a similar descriptive philosophy. 

In this paper we are mainly interested in the analogy between dictionaries and meta- 
ontologies. First, there is a correspondence between the builders of dictionaries and those 
of meta-ontologies. Corresponding to the lexicographers^], such as Samuel Johnson or 
James Murray of the OED, who create dictionaries, are the ontologicians (mathemati- 
cians, particularly category-theorists) who create meta-ontologies. Second, there is a 
correspondence between the source material used by dictionaries and that used by meta- 
ontologies. As discussed in [19] . the entries placed and described in dictionaries have 
three sources: (i) terms borrowed from other dictionaries, (ii) new terms used to express 
concepts in works of literature, and (iii) new terms used to express concepts in everyday 
speech. By analogy, the concepts axiomatized in meta-ontologies originate from three 
sources: (i) terms borrowed from other meta-ontologies, (ii) new metalevel terms used to 
express concepts in object-level ontologies, and (iii) new metalevel terms used to express 
the conceptual structure of a community. For both dictionaries and ontologies, the sec- 
ond source is most important. Lexicographers use works of literature as the main source 
for dictionaries, whereas ontologicians use the meta, generic and abstract terminology of 

5 1635-present [www . academie-f rancaise . f r] 
6 1582-present [www. accademiadellacrusca. it] 

7 A Dictionary of the English Language: In Which the Words are Deduced from Their Originals, and 
Illustrated in Their Different Significations by Examples from the Best Writers [5] 

8 Compare Samuel Johnson's definition [5] of a lexicographer as a "a writer of dictionaries, a harmless 
drudge, that busies himself in tracing the original, and detailing the signification of words" . 
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object-level ontologies as the main source for meta-ontologies. For meta-ontologies, this 
constraining process is known as "conceptual warrant" and is discussed in more detail 
below. Third, there is a correspondence between the source creators used by dictionaries 
and those used by meta-ontologies. Corresponding to the literary figures who originate 
new terms in works of literature are the knowledge engineers who originate and use new 
metalevel terms in object-level ontologies. 

4. The Standard Upper Ontology (SUO) 

The IEEE P 1600.1 Standard Upper Ontology (SUO) project [IB] , which operates under 
the auspices of the IEEE Standards Association (IEEE-SA), aims to specify an upper 
ontology that will provide a structure and a set of general concepts upon which object- 
level ontologies can be constructed. According to the IEEE-SA, "a standard is a published 
document that sets out specifications and procedures designed to ensure that a material, 
product, method, or service meets its purpose and consistently performs to its intended 
use"|^|. It is also understood that such a standard may involve some agreement about the 
conformance to implementations of the standard. As understood by the SUO working 
group, an upper ontology is limited to concepts that are meta, generic, abstract and 
philosophical. It is anticipated that the SUO will be useful in data interoperability, 
information search and retrieval, automated inferencing, and natural language processing. 
The SUO follows Robert's Rules of Order — there are various discussions on the email 
list, after which the chair can possibly call for a vote. The SUO has been active for about 
five years. During this time, it has approved several resolutions. In chronological order, 
these are described as follows. 

2001 August: Information Flow Framework (IFF). The SUO IFF projec10was the first 
proposal passed by the SUO working group. It is a descriptive category metatheory 
that represents the structural aspect of the SUO. The IFF is involved with concepts 
that are meta0, generic and abstract. The philosophy, approach, architecture and 
methods of the IFF are described in this paper (see Sections [IJ [2] and [3] for the 
philosophy, Sections |5] and [6] for the approach, Sections [7J [BJ |9] and [10] for the 
architecture and Section [IT] for the methods). As described in this paper, the IFF 

9 According to the dictionary |13j . a standard has the sense of "something established by authority, 
custom, or general consent as a model or example", "something set up and established by authority as 
a rule for the measure of quantity, weight, extent, value, or quality" , or "a structure built for or serving 
as a base or support" . A standard "applies to any definite rule, principle, or measure established by 
authority" . 

10 [suo . ieee . org/IFF/] 

11 According to the dictionary |13j . the term 'meta-' is a prefix meaning a "more highly organized or 
more comprehensive form of" something, and is "used with the name of a discipline to designate a new 
but related discipline designed to deal critically with the original one" . Examples include: a metatheory 
is a theory whose subject matter is another theory; a metalanguage is a language used to talk about other 
languages; and metamathematics is the field of study concerned with the formal structure and properties 
of mathematical systems. 
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might be called the Standard Meta Ontology (SMO). In its applicational aspect, the 
IFF project uses techniques and concepts from the fields of information flow ([]]), 
formal concept analysis ([3]), category theory ([12]) and institutions Q4J). 

2003 June: Suggested Upper Merged Ontology (SUMO). The SUO SUMO project is 
sponsored and developed by the Teknowledge Corporation^*!. The SUMO proposal 
initially failed its vote, principally due to advocacy of a monolithic philosophy for 
ontology development. About this time, there was a vigorous debate on the SUO 
email list about the monolithic-modular distinction (See Section [11] for a discussion 
about modularity). The modular philosophy for ontology development is advocated 
by the SUO Lattice of Theories project. With passage of this proposal, there is an 
expressed intent that SUO SUMO working group will collaborate with the SUO 
Lattice of Theories working group. 

2003 June: Lattice of Theories (LOT). The SUO Lattice of Theories project intends to 
develop a standard for ontology specification and registration. The standard will 
be based on the contributions of other SUO candidate projects. The standard will 
specify an ontology registry, such as the metadata registries specified by the In- 
ternational Organization for Standardization (ISO) standard ISO/IEC IS 11179-3, 
but with extensions that are needed in order to define and relate ontologies. The 
ontology registry shall be organized as a collection of modules, related in a gener- 
alization/specialization hierarchy. Each module shall consist of a theory together 
with documentation and other metadata. The theory shall consist of axioms and 
definitions stated in a logic-based language, such as those in the Common Logic 
(CL) framework. The standard shall include the specification of a methodology 
for testing the theory part of any module for consistency, relating theories to one 
another in the generalization/specialization hierarchy, and combining two or more 
theories to derive a new theory that is larger and more specialized than the theo- 
ries from which it was derived (See Section [TT] for a discussion and mathematical 
formulation of the "lattice of theories" construction.). 

2003 October: OpenCyc Ontology. The OpenCyc ontology^, which contains about 
5,000 concepts and 50,000 axioms (aka rules), is the open source version of the 
Cyc ontology. The Cyc ontology, which contains about 300,000 concepts and 3 
million axioms, is a large and general knowledge base whose intended use is for 
commonsense reasoning. It has spent over 600 person-years in a development effort 
over the past 17 years. The SUO working group would like to dismantle the OpenCyc 
ontology into meaningful components and reassemble them within the flexible and 
dynamically changeable structure of the "lattice of theories" framework (Again, see 



12 [suo . ieee . org/SUD/SUMD/] 
13 [www. teknowledge . com] 
14 [www . opencyc . org] 
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Section [TT] for a discussion and mathematical formulation of the "lattice of theories" 
construction.). 

2003 October: SUO 4D Ontology. The SUO 4D Ontology project^ intends to develop 
an ontology based on the 4-dimensional paradigm. A starting point for the de- 
velopment of this ontology is contained in the ISO standard ISO/FDIS 15926-2, a 
lifecycle integration of process plant data including oil and gas production facilities. 
The approach will be to develop the ontology as reusable components within the 
institutional aspect of the IFF and to develop mappings to other ontologies within 
this framework. 

2004 May: Multi-Source Ontology (MSO). The SUO MSO is based at the WebKB- 
2 knowledge servejlj, a knowledge server that permits Web users to browse and 
update private knowledge bases on their machines and a large shared knowledge base 
on the server machine. The ontology of the shared knowledge base is currently an 
integration of various top-level ontologies (e.g. Sowa, Dolce, the Lifecycle Integration 
Schema, the Natural Semantic Metalanguage, OWL, DAML+OIL, KIF and the 
Dublin Core) and a lexical ontology derived from an extension and correction of the 
noun-related part of WordNet 1.7. 



5. The IFF Design Guidelines 

The Information Flow Framework (IFF) (Figure |2J) is a descriptive category metatheory 
that is intended to represent the structural aspect of the SUO. In the development of 
the IFF, certain guidelines have proven to be very important. These are all predicated 
on the goal of building a category-theoretic metatheory for object-level ontologies. Ini- 
tially, this metatheory was designed to represent first order logic, its languages, theories, 
model-theoretic structures and (local) logics, including satisfaction and fibrations based at 
languages. Later, this metatheory incorporated the theory of institutions of Goguen and 
Burstall. The most important guideline in the development of the IFF — what we might 
call the meta-guideline — is to follow the intuitions of the working category-theorist. Such 
intuitions represent naive category theory^l. In practice, we initially formulate any IFF 
module as a set-theoretic axiomatization using a first order expression. Then, by elimi- 
nating quantifiers and logical connectives, we attempt to move, morph or transform this 
set-theoretic axiomatization toward a category-theoretic axiomatization. A convenient 
rule-of-thumb is to "keep it simple". From the foundational standpoint, this means that 
we start with no assumptions at all. However, in view of the Cantor diagonal argument, 
as a first step we assume a slender hierarchy called the IFF metastack. 

15 [suo . ieee . org/SU0/SU0-4D/] 
16 [www . webkb . org] 

17 The meaning of naive here is not pejorative. It means |13j primitive, natural, intuitive, first-formed, 
primary, not evolved or elemental. 
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Figure 2: The IFF Architecture (iconic version) 



The IFF metastack, which is the core hierarchy of the IFF, is represented as the cen- 
tral axis or spindle in Figure [2j The content of the metastack is the axiomatization for 
sets and functions (and for convenience, binary relations) in four core modules at four 
different set-theoretic levels, the "small" in the lower metalevel, the "large" in the upper 
metalevel, the "very large" in the top metalevel and the "generic" in the ur metalevel. 
Currently, the axiomatization for the metastack has a "conical" shaped, with an axiom- 
atization for categories introduced in the ur metalevel, an axiomatization for finite limits 
introduced in the top metalevel, an axiomatization for exponents and finite colimits in- 
troduced in the upper metalevel, and an axiomatization for subobject classifiers^] and 
general limits/colimits introduced in the lower metaleveQ This effectively distributes 
topos axioms over the four metalevels. The categories Set C Cls C Col C Ur in the IFF 
metastack, which correspond to the four metalevels mentioned above, are axiomatized 
in the lower core (IFF-LCO), the upper core (IFF-UCO), the top core (IFF-TCO) and the 
ur (IFF-UR) meta-ontologies, respectively. Here, all inclusions preserve composition and 
identities (are functorial), the first two preserve finite limits, and the first preserves the 



18 The conical shape of this axiomatization seems natural so far, even with the axiomatization for 
2-categories. However, with the proper placement of the axiomatization for institutions in the upper 
metalevel, any strong appeal for application of the categorical design principle to this institutional ax- 
iomatization would call for exponents in the top metalevel. This would give the metastack axiomatization 
more of a "cylindrical" shape. 

19 This axiomatization has not been realized so far, since it is not needed, and hence, at this time would 
violate the principle of conceptual warrant (see below). However, we anticipate its need. 

20 Assuming a conical shape, an abstract metastack M = (M4 3 M3 D M2 3 Mi) is define to be a 
category M4 at the 4 th metalevel (Ur) restricted to a finitely complete category M3 at the 3 rd metalevel 
(Top) restricted to a cartesian closed category M2 at the 2 nd metalevel (Upper) restricted to a topos Mi 
at the 1 st metalevel (Lower). 



10 



C'k 



subset 



fk+1 

Ak+i *■ -Bfc+i 



Tk+l 



Ak+i" Rk 



'fc+i 



+i- 



Bk+i 



commutative 
diagram 



multipi 



diagi am 



A k 



Ai 



Rk 



Bk 



fk 

restriction 



abridgment 



Figure 3: The three fundamental generic relations 



Cartesian closed structural. 

These preservation properties represent the fact that the axiomatization at each met- 
alevel includes the specialization of the axiomatization at the next higher metalevel. 
Specialization is a technical term here, which means the exact adaptation of the core 
terminology and axiomatization at one metalevel to the next lower metalevel by use of 
the three fundamental generic relations of subset, (function) restriction and (binary rela- 
tion) abridgment that are diagrammatically described in Figure El where k is a particular 
metalevel, k+1 is the next higher metalevel, the set Ck is a subset of the set Ck+i, the 
function fk is a restriction of the function fk+i, and the relation r k with extent Rk is an 
abridgment of the relation rk+i with extent Rk+i- Table [1] gives a brief glance at the IFF 
code that represents specialization of the core set notions. 

Another guideline is that quantification should be over a specific collection. This ca- 
pability is realized in the IFF metashell (see Section [10] on metalanguages), and hence 
all metalanguages, by the abbreviation called restricted quantification. A third guideline 
in the IFF development, is to use functions wherever possible. This encourages the elim- 
ination of quantifiers. A final guideline is that as a general rule, universes are kept away 
from power operators (and hence also exponent operators). This is part of the injunction 
to avoid contradictions (paradoxes) due to Cantor (and Russell etc.). In practice, we 
have little or no need for universes — we use the term "thing" in the Ur and Top core 
ontologies, but this has minimal use. 



6. The IFF Development Phases 

The IFF has gone through two phases of development. The IFF Category Theory (meta) 
Ontology (IFF-CAT) is distinguished by being the first meta-ontology axiomatized during 

21 Assuming a conical shape, a morphism of abstract metastacks H : M — > M' consists of a functor 
(morphism of categories) H4 : M4 — > M4 at the 4 th metalevel, restricted to a morphism of finitely complete 
categories H3 : M3 — > M3 at the 3 rd metalevel, restricted to a morphism of cartesian closed categories 
H2:M2— >M2at the 2 nd metalevel, restricted to a (geometric) morphism of toposes Hi:Mi— »Miat 
the 1 st metalevel. 
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IFF-UR: "An Ur-object represents the notion of a generic set." 

(forall (?x (object ?x)) 
(thing ?x)) 

IFF-TCO: "A collection represents the notion of a very large set. There is an 
Ur-object of all collections. Any collection is a UR-object. The Ur-object of 
all collections is not itself a collection." 

(ur: object collection) 
(forall (?c (collection ?c)) 

(ur: object ?c)) 
(not (collection collection)) 



IFF-UCO: "A class represents the notion of a large set. There is a collection 
of all classes. Classes are mainly used in IFF to specify the object and raor- 
phism collections of large categories such as the category of all small sets and 
functions Set and the category of all small classifications and infomorphisms 
Clsn. Every class is a collection. The collection of all classes is not itself a 
class." 

(vlrg. set : collection class) 

(ur : subobject class vlrg . set : collection) 

(not (class class)) 



Table 1: Code for the IFF Metastack 

the first phase of development. It indicated necessary ingredients in the core axiomatiza- 
tion now known as the metastack. The IFF-CAT has axiomatizations for large categories, 
large functors, large natural transformations, large adjunctions, large monads, and lim- 
its/colimits. Perhaps IFF-CAT is in some sense an entry-level axiomatization for the cate- 
gory of all categories. A non-starter for the remaining part of the first phase was a topos 
axiomatization. This received objections from the SUO working group, in part due to its 
lack of support by motivating examples. Rejection of the topos axiomatization prompted 
the idea of conceptual warrant. The second phase of the IFF was designed bottom-up 
and followed conceptual warrant as its guideline. The most important metalogic termi- 
nology incorporated in the second phase references the various concepts related to finite 
limits. For example, the desire to axiomatize composition of class functions in the upper 
metalevel requires the pullback concept at the top metalevel. The IFF is now well within 
its third phase of development, which involves conversion of the truth construction to the 
theory of institutions and reorganization of the metastack using the "adjunctive axioma- 
tization" technique illustrated in Lawvere and Rosebrugh [10J. A major goal for the third 
phase is to complete the categorical design principle at the lower metalevel and to initiate 
it at the upper metalevel. In order to accomplish this, we have extended composition 
to all levels of the metastack and we have introduced the important exponent metalogic 
concept in the upper core. A fourth phase is envisioned in the future, when the concepts 
of fibrations and indexed categories will be axiomatized. These are important for the 



12 



axiomatizations of institutions and fibring logics. 

During the IFF development, four concepts have eventually emerged as important. 
One of these concepts, the IFF metastack, which originated in phase three of IFF develop- 
ment, was discussed above. The other three concepts are principles for IFF development. 
In chronological order these are (1) the principle of conceptual warrant, (2) the principle 
of categorical design and (3) the principle of institutional logic. Conceptual warrant re- 
stricts the introduction of upper metalevel terminology, whereas categorical design forces 
the introduction of this terminology, principally in the core. 

Principle: conceptual warrant. All IFF terminology should require con- 
ceptual warrant for their existence: any term that appears in (and is ax- 
iomatized by) a metalanguage should reference a concept needed in a lower 
metalevel or object level axiomatization. 

The principle of conceptual warrant originated in phase one of IFF development. The ter- 
minology appearing in any standardization meta-ontology will exert authority. Because 
of this, in selecting which terminology to specify in the IFF, we utilize the notion of "con- 
ceptual warrant". Warrant means evidence for, or token of, authorization. Conceptual 
warrant is an adaptation of the librarianship notion of literary warrant. According to the 
Library of Congress, its collections serve as the literary warrant (i.e., the literature on 
which the controlled vocabulary is based) for the Library of Congress subject headings 
system. In the same fashion, the object-level and lower metalevel terminology of the IFF 
serves as the conceptual warrant for the IFF upper metalevel axiomatization. 

Principle: categorical design. The design of a module at any particular 
metalevel should adhere to the property that its axiomatic representation is 
strictly category-theoretic: All axioms use terms from the metalanguage at 
that metalevel. No axioms use explicit logical notation: No variables, quan- 
tification ('V", '3') or logical connectives ('A', 'V, '=>■','-<=>■') are used. 

The principle of categorical design originated in phase two of IFF development. The goal 
of this principle has been to simplify the IFF axiomatization — first order expression 
would be reduced to term-rewriting. The peripheral (non-core) modules in the lower IFF 
metalevel have the tripartite form: outer category namespace, inner object and morphism 
namespaces (see Section [9] for more discussion on this and Figure [7] for a visualization). 
The outer namespace fully conforms to the categorical design principle. The inner names- 
paces conform to it to a great extent (80-90%). The categorical design principle was 
originally expressed for the lower metalevel ("the small"), but with composition extended 
upward this principle also seems appropriate for the upper metalevel ("the large"); in 
particular, it seems appropriate for axiomatizing the IFF Category Theory (meta) Ontol- 
ogy (IFF-CAT). For example, Table [2] contains code for the concept of a (large) category, 
which fully conforms to the categorical design principle. 
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IFF-CAT: "A (large) category can be thought of as a special kind of graph 
— a graph with monoidal properties. It consists of an underlying graph, a 
composition graph morphism and an identity graph morphism, both with an 
identity object function." 

(vlrg. set collection category) 
(vlrg.ftn: function graph) 

(vlrg. ftn: function underlying) (= underlying graph) 

(= (vlrg. ftn: source graph) category) 

(= (vlrg. ftn: target graph) lrg.gph.obj : graph) 

(vlrg. ftn: function graph-pair) 

(= (vlrg. ftn: source graph-pair) category) 

(= (vlrg. ftn: target graph-pair) lrg.gph.obj :multipliable-pair) 

(= graph-pair (vlrg. lim.pbk. obj : pairing [graph graph])) 

(vlrg. ftn: function mu) 

(= (vlrg. ftn: source mu) category) 

(= (vlrg. ftn: target mu) lrg.gph.mor:2-cell) 

(= (vlrg. ftn: composition [mu lrg.gph.mor: source] ) 

(vlrg. ftn: composition [graph-pair lrg. gph. obj multiplication] ) ) 
(= (vlrg. ftn composition [mu lrg.gph.mor :target] ) graph) 
(vlrg. ftn: function eta) 
(= (vlrg. ftn: source eta) category) 
(= (vlrg. ftn: target eta) lrg.gph.mor :2-cell) 
(= (vlrg. ftn: composition [eta lrg.gph.mor : source] ) 

(vlrg. ftn: composition (vlrg. ftn: composition 

[graph lrg. gph. obj : object] ) lrg.gph.obj : unit] )) 
(= (vlrg. ftn composition [eta lrg.gph.mor :target] ) graph) 



Table 2: Code for (large) categories 
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Principle: institutional logical. All logics used in the IFF application (see 
Section [11]) should be formulated as institutions. 

The principle of institutional logic originated in phase three of IFF development. The 
theories of information flow and formal concept analysis (and hence, effectively the theory 
of institutions) have been used throughout the IFF development. This use has centered 
on the "truth construction". The truth construction is institutional [9], consisting of 
a classification functor dsn : Sign — > CLSN or a concept lattice functor clg : Sign — > 
CLG, where Sign is the category of signatures, and CLSN and CLG are the equivalent 
categories [7] of (large) classifications and (large) concept lattices, respectively. The truth 
classification was discussed in Barwise and Seligman [ljU In the IFF development, the 
truth concept lattice, which was defined to be the concept lattice of the truth classification, 
its equivalent intensional aspect of closed theories, or its equivalent construction of theories 
under entailment order, was promoted by the IFF as the proper representation of the 
"lattice of theories" notion advocated by the SUO working group. As noted in [13], truth 
is not dyadic between models and sentences, but triadic between models, sentences and 
signatures. This corresponds to the contextual dependency in the semiotics of Charles 
Sanders Peirce, which uses signs, objects and interpretants. In formal concept analysis 
triadic concept lattices [Uj have been used to formalize this. However, the representation 
of contexts as the logics of institutions has greater advantages. 



7. The IFF Architecture 

The IFF architecture (Figure H]) consists of metalevels, namespaces and meta-ontologies. 
Within each level, the terminology is partitioned into namespaces. The number of names- 
paces and the content may vary over time: new namespaces may be created or old names- 
paces may be deprecated, and new terminology and axiomatization within any particular 
namespace may change (new versions). In addition, within each level, various names- 
paces are collected together into meaningful composites called meta-ontologies. At any 
particular metalevel, these meta-ontologies cover all the namespaces at that level, but 
they may overlap. The number of meta-ontologies and the content of any meta-ontology 
may vary over time: new meta-ontologies may be created or old meta-ontologies may be 
deprecated, and new namespaces within any particular meta-ontology may change (new 
versions). Table [3] presents a list of IFF meta-ontologies organized by metalevel. All URLs 
in this table should be prefixed with the string 'http://suo.ieee.org/iFF/'. 

The IFF terminology is managed in terms of namespace prefixes — each namespace is 
given a unique prefix (with perhaps a few synonyms) in order to avoid clash of terminol- 
og}0 The architecture of the IFF namespace mechanism is flat — namespace prefixes 

22 Suggested to the author by Joseph Goguen (personal communication). 
23 See example 4.6 on page 71. 

24 For example, the term 'morphism' is introduced in the contexts of models and theories for first order 
logic to represent two distinct, but analogous, concepts. When these concepts need to be used in other 
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Ur: IFF-UR [30 terms, 7 pages] 
"generic" (category axioms) 
metastack/UR.pdf 

IFF-TCO [300+ terms, 60 pages] 
"very large" (collections; finite limit axioms) 
metastack/TCQ . pdf 
IFF-2CAT (basic 2-catcgory theory) 

metalevel/top/ontology/2-category/version20041004 . pdf 

Upper: IFF-UCO [500+ terms, 100+ pages] 

"large" (classes; exponents, finite limits and colimits) 
metastack/UCO . pdf 

metalevel/upper/ontology/core/version20020102 . pdf 
IFF-CAT [220+ terms, 53 pages] 
(basic category theory) 

metalevel/upper/ontology/category-theory/version20020102 .pdf 
IFF-CLSN [280+ terms, 78 pages] 

(basic information flow and formal concept analysis) 
metalevel/upper/ontology/classif ication/version20020102 .pdf 
IFF-INS 

(rudimentary institution theory) 
work-in-progress/INS/vers ion20031002.pdf 
work-in-progress/INS/vers ion20041014.pdf 

Lower: IFF-LCO [many, many terms, 150+ pages] 

"the small" (sets; quartets; arbitrary limits and colimits) 
metalevel/lower/ontology/core/version20020515 . pdf 
metalevel/lower/namespace/set/version20030402 . pdf 
IFF-CL [ 100 terms, 36 pages] 
(common logic) 

metalevel/lower/namespace/scl/version20040505-obj . pdf 

IFF-ONT [600+ terms, 194 pages, Language, Theory, Model, Logic namespaces] 

(logic and ontology) (nontraditional FOL) (old version) 

metalevel/lower/ontology/ontology/version20021205 .htm 

IFF-00 

(logic and ontology) (nontraditional FOL) (new version) 

work-in-progress/#IFF-00 

IFF-FOL [many, many terms, 150+ pages] 

(logic and ontology) (traditional FOL) 

metalevel/lower/ontology/f ol/version20040101 . html 



Top: 



Table 3: IFF (meta) Ontologies 
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Figure 4: The IFF Architecture (detailed version) 



are like tags: by using namespace prefixes the complete IFF terminology is the disjoint 
union of the terminology in the individual IFF namespaces. The IFF architecture can 
be thought of as a two dimensional structure (Figure Hj) consisting of metalevels, which 
are partitioned into top-level^ namespaces representing basic concepts such as category 
('cat'), graph ('gph'), or institution ('ins'). The various levels are indexed by the natural 
numbers '0', 'l', '2', '3', '4', '5', . . . , or for the first five levels by their natural language 
correlates 'obj', 'sml', 'lrg', 'vlrg' and 'ur', starting with the object level indexed by '0'. 

Overall, various namespaces may have the same name, since they represent the same 
basic concept at different metalevels. For each basic concept the namespace axiomati- 
zation at a particular metalevel is in two parts, one generic and the other specific: the 
generic part is the specialization of the axiomatization just above it in the hierarchy; 
these axiomatizations rely heavily upon the subcollection, restriction and abridgment re- 
lations for sets, functions and binary relations, respectively (Figure |3]); the specific part 
is an extension of the basic concept; it is a strictly new axiomatization. To locate any 
namespace one can use its level-concept pair. For example, the namespace axiomatizing 
very large categories would be denoted by 'vlrg. cat' or '3. cat'. Sub-namespaces (foot- 
note [25]) (not at top-level) will need further qualification. For example, the sub-namespace 
of small graph morphisms would be denoted by 'sml.gph.mor'. It is assumed that each 
basic concept has a particular metalevel that is in common use. These are underlined for 
the particular concepts in Figure HI For such common use namespaces, the level notation 
need not be used. For example, the namespace that axiomatizes large categories would 
be denoted by 'lrg. cat' or '2. cat', but more simply by 'cat'. 

The namespace mechanism has been made backward compatible, by allowing special 
namespace prefix notation that is equivalent to the general format just described. For 
example, the namespace axiomatizing large categories would be denoted by the general 



contexts, the namespace prefixes 'f ol .mod.mor' and 'fol.th.mor' could be used to distinguish them, 
thus resulting in the distinct representations 'f ol .mod .mor :morphism' and 'fol.th.mor:morphism'. 

25 This does not refer to the vertical dimension of the metalevel structure in Figure [4] but instead to 
an implicit third dimension of the architecture. 
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Figure 5: Core Architecture (iconic version) 
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Figure 6: Core Architecture (detailed version) 



prefixes 'cat', 'lrg.cat' or '2. cat' or by the special prefix 'CAT' that was used in earlier 
versions of the IFF-CAT axiomatization. Hence, for any basic concept, both the general 
prefix and the common level need to be declared, and for any namespace axiomatization 
the special prefixes also need to be declared. The prefixes denoting sub-namespaces 
should be compatible with the top-level prefixes; for example, in the upper metalevel, the 
general prefix for the namespace of classes would be 'lrg.set' or '2. set' with the special 
prefix 'SET', whereas the general prefix for the sub-namespace of class pullbacks would 
be 'lrg.set.lim.pbk' with the special prefix 'SET.LIM.PBK'. As these examples illustrate, 
only the lower case is needed for the general namespace prefix notation. 
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Set 




runcLion 


r\eid Lion 


'thing' 






Obj = 


'object' 




< = 'subobject ' 
_L = 'disjoint ' 
— = isomorphic' 
|_ = 'restriction' 
<C] = 'abridgment' 


Mor = 


'morphism' 


On, d\ = 'source', 'target' 
p = 'mor2rel' 


'restriction' 
Mono = 'monomorphism' 


Mor X obj Mor = 


'morphism 


MO; Ml = 'morphismO ' , 'morphisml ' 


Epi = 'epimorphism' 




-morphism' 


o, 1 = 'composition', 'identity' 


Iso = 'isomorphism' 


Rel = 


'relation' 


on, oi = 'objectO', 'object!' 

e = 'extent' 
7i"0 5 7Ti = 'projectionO' , 'projection!.' 


'abridgment ' 


Sub = 


'subordinate ' 


A, 7 = 'lesser', 'greater' 
t, 6 = 'inclusion' , 'reflex' 





Table 4: IFF-UR correspondences 

8. The IFF Metastack 

The left sides of Figures [5] and [6] illustrate the architecture for the IFF Ur (meta) Ontology 
IFF-UR, a tiny ontology at the very top of the metastack, the IFF core metalevel hierar- 
chy. Its purpose is to provide an interface between the IFF logical metashell and other 
metalevel axiomatizations. Principally, it does this by servicing the top metalevel. A 
design constraint is that "things are opaque" ; that is, the detailed state of affairs "inside" 
anything is unknown. Such detail is to be provided by a using ontology, such as the IFF 
Top Core (meta) Ontology (IFF-TCO), the IFF 2-Category (meta) Ontology (IFF-2CAT), 
and the future IFF Double-Category (meta) Ontology (IFF-DBLCAT). The IFF-UR contri- 
bution to the meta-ur metalanguage is a simple framework — the IFF-TCO, IFF-2CAT and 
IFF-DCAT fill it out. Table H] shows various correspondences between the elements on the 
left side of Figure E] and the terminology of IFF-UR (the IFF-UR-term 'thing' is not repre- 
sented in Figure E])- The IFF-UR terminology consists of only 30 terms: 6 generic sets; 16 
generic functions; and 8 generic relations, consisting of 3 binary endorelation on objects, 
1 binary and 3 unary relations on morphisms, and 1 binary relation on relations. The 
subobject, restriction and abridgment binary relations are important for the preservation 
properties of the metastack. 

The middle of Figure [5] and the right side of Figure [6] illustrate the architecture for 
the IFF Top Core (meta) Ontology IFF-TCO, which is situated in the top metalevel — the 
highest level of the IFF metastack other than Ur. It contains rudimentary (fundamental) 
namespaces for collections, functions, relations and finite limits. The IFF-TCO provides an 
adequate foundation for representing ontologies in general and for defining other metalevel 
ontologies in particular. The fact that the IFF-TCO includes the specialization of the 
IFF-UR is evident from the isomorphic embedding of the left side into the right side of 
Figure El The IFF-UR follows Mac Lane's beginning axiomatization for category theory 
[T2] in that it introduces terminology and provides an axiomatization for this terminology, 
but it does not give a formal interpretation using set theory — it only gives an informal, 
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intuitive interpretation. The IFF-TCO initiates such a formal interpretation. The IFF 
Upper and Lower Core (meta) Ontologies, IFF-UCO and IFF-LCO, complete such a formal 
interpretation. 

The IFF-TCO axiomatization contains 317 terms (276 concepts or non-identical terms), 
partitioned according to whether the term is a basic term (collections, relations or func- 
tions), a diagram term or a limit term. Ignoring the 10 terms used to designate the 
four indexing collections, there are a total of 307 terms partitioned into 58 basic terms 
designating 55 basic concepts dealing with collections, (partial) functions and relations, 
105 terms designating 97 concepts of finite diagrams, and 144 terms designating 120 con- 
cepts of finite limits proper. Although the IFF-TCO axiomatizates finite limits, the current 
version does not represent finiteness, but instead explicitly represents the several finite 
shapes needed. A future version of the IFF-TCO may experiment with Dedekind's abstract 
definition of finiteness [TO]. Terminology has been placed in the IFF Top Core Ontology 
only when it is needed in the IFF upper metalevel. All upper metalevel ontologies import 
and use, either directly or indirectly, the IFF Top Core Ontology. This includes the upper 
core, upper graph and upper category theory meta-ontologies. 

In the third phase, a revised version of the IFF Upper Core (meta) Ontology (IFF-UCO) 
will be designed. The right side of Figure |5] illustrates the architecture for the IFF-TCO. 
This axiomatization will be in adjunctive form. The IFF-UCO plays a central role in 
the IFF axiomatization - - it is the most referenced ontology at the metalevel. The 
IFF-UCO kernel consists of the axiomatization for classes and functions. At the upper 
metalevel the axiomatization for relations, which includes order-theoretic concepts, has 
branched off into an ontology in its own right. The kernel and finite limits namespace 
axiomatizations are the specializations of those in the IFF-TCO, and as such rely heavily 
upon the subcollection, restriction and abridgment relations for collections, functions and 
binary relations, respectively (Figure |3]). The finite colimits namespace is a categorical 
dual to the finite limits namespace. The exponents namespace axiomatization is new. 

9. The IFF Lower Metalevel 

Each lower-level meta-ontology in the IFF is concentrated around a single category (with 
its associated functors, natural transformations and monads), and hence the format of 
each lower-level meta-ontology has a specific category-theoretic format consisting of an 
"outer" category namespace and two "inner" namespaces for objects and morphisms. In 
Figure [7] we illustrate this format with the category Trm-Lang of term languages and term 
language morphisms^), which is axiomatized in the term language namespace trm.lang 

26 A term language is simple — it consists of a set of (indexing) variables, a set of function types or 
symbols, and an arity function from function symbols to indicia (subsets of variables). A term language 
morphism is also simple — it consists of a variable bijection and a function between function symbol 
sets, which form a set quartet between source/target arity functions (preserves arity). We could have 
ignored variables, indicia and arity for languages, and just used natural numbers and valence instead, 
with language morphisms required to preserve valence. In fact, the classic Lawvere category does this — 
it uses natural numbers as objects, in place of indicia, and uses term sequences in place of term tuples. 



20 



trm. lang 

category namespace 

trm. lang. obj 
object namespace 



trm. lang. mor 
morphism namespace 



Figure 7: Lower Core Format 

of the IFF Term (meta) Ontology IFF-TRM. The category namespace of trm. lang is com- 
pletely compliant with the categorical-design principle. This is illustrated in Table 
which lists the logical code for the category Trm-Lang, the functor ftn : Trm-Lang — > Set, 
the arity natural transformation # : ftn =>- var o p : Trm-Lang — > Set, and the Lawvere 
construction law : Trm-Lang — > Ca@ The "outer" category namespace of trm . lang uses 
terminology from the meta-upper metalanguage, plus the lower metalevel terminology 
from the object and morphism namespaces of trm. lang. The latter are only partially 
(80-90%) compliant with the categorical-design principle. Part of the goal for phase 3 is 
to bring these "inner" namespaces toward 100% compliance. 

The basic architecture of term languages is illustrated in Figure [BJ which consists of 
three sub-diagrams — the term sub-diagram (upper left), the term tuple sub-diagram 
(lower left) and the Lawvere sub-diagram (right). The term sub-diagram illustrates the 
fixpoint solution for terms, the term tuple sub-diagram illustrates the embedding struc- 
tures for term tuples, and the Lawvere sub-diagram illustrates the Lawvere category 
law(L) of a term language L, which has indicia as objects, term tuples as morphisms, and 
substitution as composition. The Lawvere construction is a collection of (small) categories 
and functors indexed by term languages and term language morphisms. Abstractly, the 
Lawvere construction is a (small) category object in the (large) category of functors and 
natural transformations between the categories Trm-Lang and Set. In Figure El the nodes 
represent basic functors from Trm-Lang to Set, and the edges represent natural transfor- 
mations between these basic functors. There are five simple functors var, ftn, case, trm 
and tpl, for variables, function symbols, variable cases, terms and term tuples, respec- 

However, we used variables for at least four reasons. (1) Standards such as the common logic standard 
(CL) use a set of variables as part of their lexicons. (2) We did not want to rely upon sequences and a 
natural numbers axiomatization. (3) The use of variable-type pairs for arguments in function declaration 
is common in programming languages. (4) We want the expression operator, which requires variables and 
arities as part of the formulation of expressions, to be a monad. It is very easy to add natural numbers, 
valence and term sequences, thus presenting traditional first order logic. 

27 The functor ftn and the natural transformation # are explicitly represented in Figure [U whereas 
Trm-Lang is the ambient category, and law is represented by the Lawvere sub-diagram. 
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trm.lang: "There is a (large) category Trm-Lang of term languages. The 
object class of Trm-Lang is the class of all term languages. The morphism class 
of Trm-Lang is the class of all term language morphisms. Composition in Trm- 
Lang is composition of term language morphisms." 

(cat : category language) 

(= (cat:object language) trm. lang. obj : object) 

(= (cat :morphism language) trm. lang. mor: morphism) 

(= (cat: source language) trm. lang. mor : source) 

(= (cat:target language) trm. lang. mor :target) 

(= (cat : composable language) trm. lang. mor : composable) 

(= (cat : composition language) trm. lang. mor : composition) 

(= (cat : identity language) trm. lang. mor : identity) 



trm.lang: "There is a function symbol functor from the (large) category of 
term languages to the (large) category of sets ftn : Trm-Lang — > Set. There is 
an arity natural transformation # : ftn => var o p : Trm-Lang — > Set, whose L th 
component for any term language L is the arity function for L ." 

(f unc : functor function) 

(= (func: source function) language) 

(= (func: target function) set: set) 

(= (func:object function) trm. lang. obj : function) 

(= (func : morphism function) trm. lang. mor : function) 

(nat : natural-transformation f unction-arity) 

(= (nat : source-category f unction-arity) language) 

(= (nat : target-category f unction-arity) set: set) 

(= (nat : source-functor f unction-arity) function) 

(= (nat : target-functor f unction-arity) indicia) 

(= (nat : component f unction-arity) trm. lang. obj : f unction-arity) 



trm . lang: "The Lawvere construction is a functor law : Trm-Lang — > Cat from 
the (large) category of term languages to the (large) category of small categories: 
for any term language L, law(L) is a small category with coproducts and for any 
term language morphism f : Li — > L2, law(/) : law(Li) — > law(Z/2) is a functor 
between small categories that preserves coproducts." 

(func: functor lawvere) 

(= (func: source lawvere) language) 

(= (func: target lawvere) sml . cat : category) 

(= (func:object lawvere) trm. lang. obj .tpl : lawvere) 

(= (func: morphism lawvere) trm. lang. mor .tpl : lawvere) 



Table 5: Code for trm-lang 



22 



Term 



Lawvere 



i elem subst i 

icase =4> trm 4= ftn <g) tpl! 



4 



e® 1 



tpl 



Tuple 



subst 

trm^=trm (g> tpl 



7T0 7T1 

pr°^ var<^=var x trm=Hrm « 
case {-}JL #JL ftn 

indicvarop <^ tp] varopi # 

: § # ! 




Figure 8: Term Language Architecture: 
Set-valued functors and natural transformations 
on the category Trim- Lang 



e = function as term embedding 
# = function/term/tuple arity 
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tively. The first three are basic and the last two are inductively defined. Based on these, 
there are five composite functors: varop, varxtrm, ftn <g> tpl, trm £g> tpl, tpl £g> tpl, for in- 
dicia (variable subsets), indexed terms (variable-term pairs), substitutable function-tuple 
pairs, substitutable term-tuple pairs and composable tuple-tuple pairs, respectively The 
(g) symbol refers to a matched Cartesian product — the arity of the first (function, term 
or tuple) matches the index of the second (tuple). 



10. The IFF Metalanguages 

Table M lists selected examples of IFF terminology organized along metalevels. In gen- 
eral, the terminology introduced at any level uses the terminology in the same or higher 
metalevels. Object level ontologies use lower metalevel terminology and functionality. 
For example, the terminology and functionality introduced and axiomatized in the lower 
metalevel IFF theory namespace (trm), which is part of the IFF First Order Logic (meta) 
Ontology (IFF-FOL), could be used to organize and axiomatize an E-commerce ontology in 
the object level. Lower metalevel namespaces and meta-ontologies use upper level termi- 
nology and functionality. For example, the terminology and functionality introduced and 
axiomatized in the upper metalevel IFF Category Theory (meta) Ontology (IFF-CAT) is 
used to organize and axiomatize the lower metalevel IFF language namespace (lang) that 
is part of the IFF-FOL meta-ontology. Upper metalevel namespaces and meta-ontologies 
use top metalevel terminology and functionality. For example, the terminology and func- 
tionality introduced and axiomatized in the top metalevel IFF Top Core (meta) Ontology 
(IFF-TCO) is used to organize and axiomatize the IFF-CAT meta-ontology. 

There are thousands of terms in the IFF. Terms are divided into two classes, which 
we can call "usable terms" and "supporting terms". An IFF term, which is defined in 
a particular namespace on a particular metalevel, is a usable term when it is used by at 
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Core 



Periphery 



metashell 
meta-ur 



meta-top 



meta-upper 



meta-lower 



object level 



'and', 'or', 'implies', 'iff, 'forall', 'not', 'forall', 'exists' 


'object', 'morphism' , 'relation', 
'subobject', 'restriction', 'abridgment', 
'composition', 'identity' 




'collection', ' subcollection' , 'function', 
'composition', 'identity', 'restriction', 
'relation', 'abridgment' 




'collection-pair' , 'parallel-pair' , 
'opspan', 'binary-product', 'equalizer', 
'pullback' 


'class', 'function', 'relation', 
'subclass', 'restriction', 'abridgment', 
'composition', 'identity' 


'category', 'functor', 

'natural-transformation' , 'adjunction' , 
'monad', 'graph', 'partial-order', 
'total-order' , 'equivalence-relation' , 
'classification' , ' inf omorphism' , 
'concept', 'concept-lattice', 'bond' 


'unary-function' , 'binary-function' , 
'curry', 'horn', 'exponent' 


'set', 'function', 'relation', 'subset', 
'restriction', 'abridgment', 'composition', 
' identity' 


'entity', 'relation', 'arity', 'language', 
'theory', 'logic', 'language-morphism' , 
'theory-morphism' , ' logic-morphism' 


'diagram', 'colimit', 'direct-flow', 
' inverse-flow' 


'person', 'company', 'employ', 'salary', 'date', ' real -number' , ... 



Table 6: Example IFF Terms 



'abridgment ' 


' ant i symmetr i c ' 


'bijection' 


'binary coproduct ' 


'binary intersection' 


'binary product ' 


'binary union' 


'bond' 


'bonding pair' 


'categorical equivalence' 


'category' 


'class' 


'classification' 


'cocone' 


'coequalizer ' 


'colimit ' 


'colimit injection' 


'collection' 


'composition' 


'concept lattice' 


'cone ' 


'constant function' 


'coproduct ' 


'currying' 


'diagram' 


'disjoint' 


'domain' 


'empty' 


'epimorphism' 


'equalizer' 


'equivalence relation' 


'evaluation' 


' exponent ' 


'function' 


'functional relation' 


'functor' 


'horn' 


'hypergraph' 


'identity' 


'inclusion' 


' inf omorphi sm ' 


'initial ' 


'injection' 


'institution' 


'institution morphism' 


'inverse currying' 


'isomorphic ' 


'isomorphism' 


'language ' 


'limit ' 


'limit projection' 


'mediator' 


'monad ' 


'monomorphism' 


l mor2rel ' 


'morphism' 


'natural transformation' 


'nothing' 


'null' 


'object' 


'one ' 


'opspan' 


'order' 


'packing' 


'pair' 


'parallel pair' 


'partial function' 


'partial order' 


'pfn2ftn' 


'pf n2rel ' 


'product ' 


'product with' 


'pullback' 


'pushout ' 


'reflexive ' 


'relation' 


'restriction' 


'set' 


'signature' 


'source' 


'span' 


' spangr aph ' 


'subclass' 


'subcollection' 


'subordination' 


'subset' 


'surjection' 


'symmetric ' 


'target ' 


'terminal ' 


'ternary coproduct' 


'ternary product ' 


'thing' 


'three' 


'total order' 


'total relation' 


'transitive ' 


'triple' 


'two ' 


'unique function' 


'unique morphism' 


'unit' 


'unpacking' 


'vocabulary' 


'zero ' 









Table 7: Basic IFF Terms 
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vlrg.set: "Two (very large) sets (aka collections) are isomorphic when 
they are connected by a (very large) bisection? 

(forall (?c0 (collection ?c0) ?cl (collection ?cl)) 
(and (iff (isomorphic ?c0 ?cl) 

(exists (?f (vlrg.ftn:bijection ?f)) 

(and (= (source ?f) ?c0) (= (target ?f) ?cl)))) 



Table 8: metashell code 

least one other term in another namespace on that metalevel or on a level below that 
one. An IFF term, which is defined in a particular namespace on a particular metalevel, 
is a supporting term when it is used by another term in that same namespace. Because 
of conceptual warrant, all IFF terms should be usable or supporting, and perhaps both. 
Hence, all IFF terms are necessary, but most IFF terms are "conceptually derived". This 
means that they are a conceptual composite of two or more basic IFF terms. An IFF 
term is a basic IFF term when it is not the conceptual composite of two or more other 
IFF terms. Table [7] lists some basic IFF terms. 

There are four IFF metalevels: lower, upper, top and ur. Each metalevel services the 
levels below: the ur metalevel services the top metalevel, the ur and top metalevels service 
the upper metalevel, the ur, top and upper metalevels service the lower metalevel, and the 
ur, top, upper and lower metalevels service the object-level. There is one metalanguage 
based at each metalevel. Any metalanguage can be used by the meta-ontologies and 
ontologies at all lower levels. These metalanguages can be defined in terms of their top- 
down nesting. Each metalevel k has an associated metalanguage meta(fc), which is the 
union of "old" terminology, "specialization" terminology and "new" terminology. The 
old terminology is the terminology of the metalanguage meta(£;+l) associated with the 
metalevel k+1 immediately above, the specialization terminology is the terminology of the 
specialization of the k+1 axiomatization to the k th metalevel, and the new terminology is 
from the non-specialization axiomatization in the various meta-ontologies at metalevel k. 

Proceeding in a top-down fashion, the metalanguage hierarchy starts with a logical 
shell called the metashell. The metashell enables a lisp-like first-order expression using 
connectives and restricted quantification. Table M contains an example expression from 
the collection namespace of the IFF-TCO meta-ontology that uses much of the metashell: 
it uses both universal and existential restricted quantification, plus the conjunction and 
equivalence connectives. In order to define the metalanguage for a particular metalevel, 
the metashell language is typically used in two ways. First, the typing of the terminol- 
ogy for a metalanguage uses the metalanguage at the next higher metalevel, where the 
metashell expression is restricted to membership for sets, application for functions, and 
holds for binary relations [this is strictly category-theoretic]. Second, the meaning of the 
terminology for a metalanguage uses the metalanguage at the next higher metalevel [this 
includes unrestricted metashell (quantifiers, connectives, etc.)]. 

In addition to the metashell, there results a hierarchy of metalanguages (Figure [9]) 
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metashell 
meta-ur 
meta-top 
meta-upper 
meta-lower 



Figure 9: The IFF Metalanguage Hierarchy 

coordinated with the the metastack. Each metalanguage includes the ones above, with 
the metashell first order logical expression as innermost: the metashell is contained 
within the meta(4) = meta-ur core metalanguage, which is contained within the meta(3) 
= meta-top metalanguage, which is contained within the meta(2) = meta-upper metalan- 
guage, which is contained within the meta(l) = meta-lower metalanguage. Any object- 
level language uses the meta-lower metalanguage in its axiomatization. The meta-ur 
metalanguage is special — it axiomatizes the entire metastack. Visualizing the meta- 
language nesting in terms of the metastack, the meta-ur metalanguage acts as a central 
spindle oriented vertically (see Figure [2]), around which the metalanguages for the three 
other metalevels are ringed. 

11. The IFF Application 

Similar to software development, there are two approaches to the standardization of con- 
ceptual knowledge (ontology development): the monolithic approach and the modular 
approach. But standardization strongly affects maintenance, and in the maintenance of 
ontologies [8] the modular approach to conceptual knowledge representation has advan- 
tages over the monolithic. In addition, important distributed environments, such as the 
Semantic World-Wide Web and organizational intranets, represent their information in 
a modular fashion with multiple ontologies and schemas. For these reasons, the SUO 
advocates the modular approach in their lattice of theories (LOT) project. The goal of 
the LOT project is to create a framework "which can support an open-ended number of 
theories organized in a lattice together with systematic metalevel techniques for moving 
from one to another, for testing their adequacy for any given problem, and for mixing, 
matching, combining, and transforming them to whatever form is appropriate for what- 
ever problem anyone is trying to solve" (John F. Sowa, SUO archives, see also [18]). A 
challenge to the modular approach is the need for a semantic integration of ontologies. 

The main application of the IFF is institutional, involving ontology development and 
semantic integration. To be logic independent, the IFF represents and manipulates onto- 
logical structures within the metatheory of institutions [I] , which allows the formalization, 
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Figure 10: The IFF-FOL Module Hierarchy 



representation, implementation and translation of logics. The metatheory of institutions 
is being axiomatized in the upper metalevels of the IFF. Its institutional approach to 
logical semantics provides a principled framework for the modular design of object-level 
ontologies in general, and for the "lattice of theories" approach to ontological organiza- 
tion in particular. Within institutions, the lattice of theories is the fibring or indexing 
of the category of theories over the category of languages (aka signatures). Institutions 
formally express semantic integration as an ontological fusion process [9] : align ontologies 
within a diagram of theories and fuse aligned ontologies via the colimit of this diagram. 
The IFF has work-in-progress axiomatizations for the institutions and connecting institu- 
tion morphisms of information flow (IF), equational logic (EQL), order-sorted first order 
logic (FOL) and the common logic standard (CL). Although semantic integration can 
be represented within an arbitrary institution |9j, semantic integration in FOL-related 
institutions is of particular interest to the SUO project. The architecture of FOL involves 
a fundamental exomorphic-endomorphic distinction^!. The exomorphic aspect of FOL is 
needed for a principled formulation of the "lattice of theories" concept. 

The IFF First Order Logic (meta) Ontology ( IFF-FOL) gives a traditional axiomatiza- 
tion for FOL. The IFF-FOL is embedded within a very modular architecture as illustrated 
in Figure [10J This describes several institutions and institution morphisms, including 
EQN the institution for equational logic and FOL the institution for first order logic 
(with equality). The central bifurcation in Figure ITUl is between terms and expressions. 
FOL languages are the pullback of expression languages and term languages over (bi- 
jections of) variables. FOL languages with equality are the pullback of FOL languages 



28 Exomorphic refers to the architecture outside the notion of an FOL language, whereas endomorphic 
refers to the architecture within an FOL language. The exomorphic aspect of FOL is concerned with 
the institutional approach to FOL; this is an indexed category architecture based on substitution along 
language morphisms. The endomorphic aspect of FOL, which is formalized in various treatments of 
categorical logic (see [15] and [17]), is concerned with the hyperdoctrine approach to FOL; this is an 
indexed category architecture based on substitution along tuples in the Lawvere construction. 
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and universal algebra (equational languages) over term languages. Expression languages 
consist of relation (type) symbols and variables. To get FOL languages with equality, 
pullback relations along functions and equations. Term languages consist of function 
(type) symbols and variables. The Lawvere construction is defined here. Equations can 
be added giving equational languages (equational presentations) as an extension of term 
languages. They define a quotient of their Lawvere category. Expression languages consist 
of relation (type) symbols and variables. Peircian existential graphs can be included here. 
FOL languages consist of function (type) symbols, relation (type) symbols and variables. 
From the modular perspective of Figure [TDl an FOL language is a term language and 
an expression language that share a common set of variables. The important submod- 
ules in the FOL axiomatization are the following: a term/tuple fixpoint axiomatization, 
an expression/arity fixpoint axiomatization, the Lawvere construction axiomatization, an 
axiomatization for term-tuple coproducts, the term monad axiomatization and the expres- 
sion monad axiomatization. Each edge in the diagram of (Figure [TO]) is associated with 
institution morphisms in both directions, projection downward and inclusion upward. 

12. Summary and Future Work 

In this paper, we have presented the Information Flow Framework (IFF) as a descriptive 
category metatheory by discussing its design guidelines, development phases, architecture 
and institutional application. Meta-ontologies (i.e., metatheories), as well as dictionar- 
ies, are classified by the prescriptive-descriptive distinction. The development of meta- 
ontologies has strong analogies with the development of dictionaries. The IFF, which was 
the first of several projects accepted for ontology development under the auspices of the 
Standard Upper Ontology (SUO) working group, forms its structural aspect. The design 
guidelines make apparent that the IFF is being developed by following the intuitions of the 
working category-theorist. The IFF architecture, which consists of metalevels partitioned 
into namespaces, is structured by its namespace mechanism. The IFF metastack, which 
forms the heart of the IFF architecture, represents the small, the large, the very large and 
the generic. This is formalized in terms of four metalevels of sets, linked by functions and 
binary relations. The specialization between metalevels is effected by set subset, function 
restriction and relation abridgment. Three principles discovered during the development 
phases of the IFF are conceptual warrant, categorical design and institutional logic. The 
linguistic aspect of the IFF is represented by four nested metalanguages, with each meta- 
language linked to a metalevel. The metashell provides standard logical notation for the 
metalanguages with restricted quantification for typing. The main application of the IFF 
uses the metatheory of institutions to abstractly express the modular modeling technique 
of the lattice of theories framework used in the development and semantic integration of 
object-level ontologies. 

We anticipate that fibrations and indexed categories will be central to the fourth 
phase of IFF development. The large categories associated with many lower level meta- 
ontologies are fibered. Furthermore, indexed categories, fibrations and the Grothendieck 
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construction are very important to the metatheory of institutions, whose axiomatization 
is currently under development in the IFF. We also want to axiomatize sketches, and 
incorporate these with the axiomatization for institutions. Furthermore, it would be 
interesting to investigate the connections between higher order logic and the first order 
expression at the various IFF metalevels. Finally, we make two observations. The IFF 
regards category theory as a meta-ontology, which is exactly the spirit of the paper [2]. 
However, in a sense the IFF has outgrown its origins and institutional applications. It 
was originally based upon information flow and formal concept analysis with a slender 
category theory core. But now it has an increasingly strong categorically oriented super- 
structure. We propose to split the old IFF into (1) the descriptive category metatheory 
proper, calling this meta, and (2) a new smaller IFF that covers only the institutional 
application to ontology development and semantic integration. In addition, we propose the 
initiation of a working group for development of a category theory standard, as currently 
manifested in meta. 
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